Overview

Dataset statistics

Number of variables16
Number of observations2777
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory347.2 KiB
Average record size in memory128.0 B

Variable types

Numeric16

Alerts

gross_revenue is highly correlated with qnt_purchases and 3 other fieldsHigh correlation
qnt_purchases is highly correlated with gross_revenue and 2 other fieldsHigh correlation
var_products is highly correlated with gross_revenue and 3 other fieldsHigh correlation
qnt_items is highly correlated with gross_revenue and 3 other fieldsHigh correlation
avg_ticket is highly correlated with avg_basket_varietyHigh correlation
avg_recency_days is highly correlated with freq_purchaseHigh correlation
freq_purchase is highly correlated with avg_recency_daysHigh correlation
qtd_returned is highly correlated with freq_returns and 2 other fieldsHigh correlation
freq_returns is highly correlated with qtd_returned and 2 other fieldsHigh correlation
avg_basket_size is highly correlated with gross_revenue and 1 other fieldsHigh correlation
avg_basket_variety is highly correlated with var_products and 1 other fieldsHigh correlation
item_rp_ratio is highly correlated with qtd_returned and 2 other fieldsHigh correlation
net_margin is highly correlated with qtd_returned and 2 other fieldsHigh correlation
gross_revenue is highly correlated with qnt_purchases and 1 other fieldsHigh correlation
qnt_purchases is highly correlated with gross_revenue and 2 other fieldsHigh correlation
var_products is highly correlated with qnt_purchasesHigh correlation
qnt_items is highly correlated with gross_revenue and 2 other fieldsHigh correlation
avg_ticket is highly correlated with qtd_returned and 1 other fieldsHigh correlation
qtd_returned is highly correlated with avg_ticketHigh correlation
avg_basket_size is highly correlated with qnt_items and 1 other fieldsHigh correlation
item_rp_ratio is highly correlated with net_marginHigh correlation
net_margin is highly correlated with item_rp_ratioHigh correlation
gross_revenue is highly correlated with qnt_purchases and 2 other fieldsHigh correlation
qnt_purchases is highly correlated with gross_revenue and 2 other fieldsHigh correlation
var_products is highly correlated with gross_revenue and 2 other fieldsHigh correlation
qnt_items is highly correlated with gross_revenue and 3 other fieldsHigh correlation
avg_recency_days is highly correlated with freq_purchaseHigh correlation
freq_purchase is highly correlated with avg_recency_daysHigh correlation
qtd_returned is highly correlated with freq_returns and 2 other fieldsHigh correlation
freq_returns is highly correlated with qtd_returned and 2 other fieldsHigh correlation
avg_basket_size is highly correlated with qnt_itemsHigh correlation
item_rp_ratio is highly correlated with qtd_returned and 2 other fieldsHigh correlation
net_margin is highly correlated with qtd_returned and 2 other fieldsHigh correlation
df_index is highly correlated with avg_recency_daysHigh correlation
gross_revenue is highly correlated with qnt_purchases and 4 other fieldsHigh correlation
qnt_purchases is highly correlated with gross_revenue and 4 other fieldsHigh correlation
var_products is highly correlated with gross_revenue and 3 other fieldsHigh correlation
qnt_items is highly correlated with gross_revenue and 4 other fieldsHigh correlation
avg_ticket is highly correlated with qtd_returned and 2 other fieldsHigh correlation
avg_recency_days is highly correlated with df_indexHigh correlation
qtd_returned is highly correlated with gross_revenue and 5 other fieldsHigh correlation
avg_basket_size is highly correlated with gross_revenue and 4 other fieldsHigh correlation
item_rp_ratio is highly correlated with net_marginHigh correlation
net_margin is highly correlated with avg_ticket and 1 other fieldsHigh correlation
avg_ticket is highly skewed (γ1 = 27.69775288) Skewed
freq_purchase is highly skewed (γ1 = 46.11035601) Skewed
qtd_returned is highly skewed (γ1 = 21.64144035) Skewed
df_index has unique values Unique
customer_id has unique values Unique
recency_days has 33 (1.2%) zeros Zeros
qtd_returned has 1484 (53.4%) zeros Zeros
freq_returns has 1484 (53.4%) zeros Zeros
item_rp_ratio has 1484 (53.4%) zeros Zeros

Reproduction

Analysis started2021-10-20 02:30:24.342956
Analysis finished2021-10-20 02:30:49.129515
Duration24.79 seconds
Software versionpandas-profiling v3.1.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct2777
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2253.89197
Minimum0
Maximum5705
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:49.191914image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile181.8
Q1903
median2063
Q33415
95-th percentile4964.2
Maximum5705
Range5705
Interquartile range (IQR)2512

Descriptive statistics

Standard deviation1528.418125
Coefficient of variation (CV)0.6781239498
Kurtosis-0.9554136096
Mean2253.89197
Median Absolute Deviation (MAD)1242
Skewness0.3797900471
Sum6259058
Variance2336061.965
MonotonicityStrictly increasing
2021-10-19T23:30:49.281983image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
< 0.1%
29121
 
< 0.1%
28981
 
< 0.1%
29011
 
< 0.1%
29021
 
< 0.1%
29061
 
< 0.1%
29071
 
< 0.1%
29081
 
< 0.1%
29091
 
< 0.1%
29111
 
< 0.1%
Other values (2767)2767
99.6%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
41
< 0.1%
51
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
ValueCountFrequency (%)
57051
< 0.1%
56951
< 0.1%
56891
< 0.1%
56641
< 0.1%
56581
< 0.1%
56471
< 0.1%
56461
< 0.1%
56291
< 0.1%
56281
< 0.1%
56191
< 0.1%

customer_id
Real number (ℝ≥0)

UNIQUE

Distinct2777
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15284.17033
Minimum12347
Maximum18287
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:49.375757image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum12347
5-th percentile12625.8
Q113815
median15240
Q316779
95-th percentile17950.2
Maximum18287
Range5940
Interquartile range (IQR)2964

Descriptive statistics

Standard deviation1715.038366
Coefficient of variation (CV)0.1122101055
Kurtosis-1.205963417
Mean15284.17033
Median Absolute Deviation (MAD)1483
Skewness0.0167561014
Sum42444141
Variance2941356.595
MonotonicityNot monotonic
2021-10-19T23:30:49.468721image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
178501
 
< 0.1%
141631
 
< 0.1%
177041
 
< 0.1%
169331
 
< 0.1%
137721
 
< 0.1%
162491
 
< 0.1%
141981
 
< 0.1%
139891
 
< 0.1%
179301
 
< 0.1%
144821
 
< 0.1%
Other values (2767)2767
99.6%
ValueCountFrequency (%)
123471
< 0.1%
123481
< 0.1%
123521
< 0.1%
123561
< 0.1%
123581
< 0.1%
123591
< 0.1%
123601
< 0.1%
123621
< 0.1%
123631
< 0.1%
123641
< 0.1%
ValueCountFrequency (%)
182871
< 0.1%
182831
< 0.1%
182821
< 0.1%
182731
< 0.1%
182721
< 0.1%
182701
< 0.1%
182651
< 0.1%
182631
< 0.1%
182611
< 0.1%
182601
< 0.1%

gross_revenue
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct2763
Distinct (%)99.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2842.127526
Minimum36.56
Maximum279138.02
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:49.563437image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum36.56
5-th percentile264.584
Q1627.13
median1166.77
Q32420.84
95-th percentile7467.416
Maximum279138.02
Range279101.46
Interquartile range (IQR)1793.71

Descriptive statistics

Standard deviation10459.57225
Coefficient of variation (CV)3.680191038
Kurtosis373.3049261
Mean2842.127526
Median Absolute Deviation (MAD)684.76
Skewness17.10907554
Sum7892588.14
Variance109402651.7
MonotonicityNot monotonic
2021-10-19T23:30:49.652727image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1025.442
 
0.1%
1353.742
 
0.1%
745.062
 
0.1%
379.652
 
0.1%
734.942
 
0.1%
2092.322
 
0.1%
178.962
 
0.1%
889.932
 
0.1%
731.92
 
0.1%
2053.022
 
0.1%
Other values (2753)2757
99.3%
ValueCountFrequency (%)
36.561
< 0.1%
521
< 0.1%
52.21
< 0.1%
62.431
< 0.1%
68.841
< 0.1%
70.021
< 0.1%
77.41
< 0.1%
84.651
< 0.1%
90.31
< 0.1%
93.351
< 0.1%
ValueCountFrequency (%)
279138.021
< 0.1%
259657.31
< 0.1%
194550.791
< 0.1%
140450.721
< 0.1%
124564.531
< 0.1%
117379.631
< 0.1%
91062.381
< 0.1%
72882.091
< 0.1%
66653.561
< 0.1%
65039.621
< 0.1%

recency_days
Real number (ℝ≥0)

ZEROS

Distinct252
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.75693194
Minimum0
Maximum372
Zeros33
Zeros (%)1.2%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:49.747566image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q110
median29
Q373
95-th percentile211
Maximum372
Range372
Interquartile range (IQR)63

Descriptive statistics

Standard deviation68.4423255
Coefficient of variation (CV)1.20588487
Kurtosis3.405942763
Mean56.75693194
Median Absolute Deviation (MAD)24
Skewness1.891632142
Sum157614
Variance4684.35192
MonotonicityNot monotonic
2021-10-19T23:30:49.841452image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
199
 
3.6%
487
 
3.1%
285
 
3.1%
385
 
3.1%
876
 
2.7%
1067
 
2.4%
966
 
2.4%
765
 
2.3%
1762
 
2.2%
2255
 
2.0%
Other values (242)2030
73.1%
ValueCountFrequency (%)
033
 
1.2%
199
3.6%
285
3.1%
385
3.1%
487
3.1%
543
1.5%
765
2.3%
876
2.7%
966
2.4%
1067
2.4%
ValueCountFrequency (%)
3721
 
< 0.1%
3661
 
< 0.1%
3601
 
< 0.1%
3583
0.1%
3541
 
< 0.1%
3371
 
< 0.1%
3362
0.1%
3341
 
< 0.1%
3332
0.1%
3301
 
< 0.1%

qnt_purchases
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct55
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6.049333813
Minimum2
Maximum206
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:49.939524image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2
Q12
median4
Q36
95-th percentile17
Maximum206
Range204
Interquartile range (IQR)4

Descriptive statistics

Standard deviation9.067395776
Coefficient of variation (CV)1.498908153
Kurtosis184.1067332
Mean6.049333813
Median Absolute Deviation (MAD)2
Skewness10.62910516
Sum16799
Variance82.21766616
MonotonicityNot monotonic
2021-10-19T23:30:50.038550image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2782
28.2%
3500
18.0%
4393
14.2%
5237
 
8.5%
6173
 
6.2%
7138
 
5.0%
898
 
3.5%
969
 
2.5%
1055
 
2.0%
1154
 
1.9%
Other values (45)278
 
10.0%
ValueCountFrequency (%)
2782
28.2%
3500
18.0%
4393
14.2%
5237
 
8.5%
6173
 
6.2%
7138
 
5.0%
898
 
3.5%
969
 
2.5%
1055
 
2.0%
1154
 
1.9%
ValueCountFrequency (%)
2061
< 0.1%
1991
< 0.1%
1241
< 0.1%
971
< 0.1%
912
0.1%
861
< 0.1%
721
< 0.1%
622
0.1%
601
< 0.1%
571
< 0.1%

var_products
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct467
Distinct (%)16.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.6622254
Minimum2
Maximum7838
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:50.139139image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile10
Q134
median72
Q3143
95-th percentile399.6
Maximum7838
Range7836
Interquartile range (IQR)109

Descriptive statistics

Standard deviation277.6455126
Coefficient of variation (CV)2.141298375
Kurtosis337.1576638
Mean129.6622254
Median Absolute Deviation (MAD)45
Skewness15.35610984
Sum360072
Variance77087.03068
MonotonicityNot monotonic
2021-10-19T23:30:50.233631image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2838
 
1.4%
3534
 
1.2%
2730
 
1.1%
2630
 
1.1%
2930
 
1.1%
3128
 
1.0%
1527
 
1.0%
1927
 
1.0%
2527
 
1.0%
3326
 
0.9%
Other values (457)2480
89.3%
ValueCountFrequency (%)
211
0.4%
312
0.4%
416
0.6%
516
0.6%
624
0.9%
714
0.5%
813
0.5%
919
0.7%
1019
0.7%
1123
0.8%
ValueCountFrequency (%)
78381
< 0.1%
56731
< 0.1%
50951
< 0.1%
45801
< 0.1%
26981
< 0.1%
23791
< 0.1%
20601
< 0.1%
18181
< 0.1%
16731
< 0.1%
16371
< 0.1%

qnt_items
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1638
Distinct (%)59.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1670.081743
Minimum2
Maximum196844
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:50.332516image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile119.8
Q1330
median703
Q31478
95-th percentile4607
Maximum196844
Range196842
Interquartile range (IQR)1148

Descriptive statistics

Standard deviation5886.62708
Coefficient of variation (CV)3.524753866
Kurtosis486.1295226
Mean1670.081743
Median Absolute Deviation (MAD)451
Skewness18.18931357
Sum4637817
Variance34652378.38
MonotonicityNot monotonic
2021-10-19T23:30:50.429058image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
31011
 
0.4%
2468
 
0.3%
1508
 
0.3%
3007
 
0.3%
2197
 
0.3%
4937
 
0.3%
2607
 
0.3%
12007
 
0.3%
2727
 
0.3%
5167
 
0.3%
Other values (1628)2701
97.3%
ValueCountFrequency (%)
21
< 0.1%
161
< 0.1%
171
< 0.1%
191
< 0.1%
201
< 0.1%
251
< 0.1%
272
0.1%
301
< 0.1%
321
< 0.1%
332
0.1%
ValueCountFrequency (%)
1968441
< 0.1%
802631
< 0.1%
773731
< 0.1%
699931
< 0.1%
645491
< 0.1%
641241
< 0.1%
633121
< 0.1%
583431
< 0.1%
578851
< 0.1%
502551
< 0.1%

avg_ticket
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct2775
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.08667366
Minimum2.150588235
Maximum4453.43
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:50.528649image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum2.150588235
5-th percentile4.853447184
Q112.44512195
median17.94344444
Q324.99947917
95-th percentile87.58663158
Maximum4453.43
Range4451.279412
Interquartile range (IQR)12.55435722

Descriptive statistics

Standard deviation107.5551384
Coefficient of variation (CV)3.352018959
Kurtosis1056.119771
Mean32.08667366
Median Absolute Deviation (MAD)6.312279396
Skewness27.69775288
Sum89104.69275
Variance11568.1078
MonotonicityNot monotonic
2021-10-19T23:30:50.618552image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
4.1622
 
0.1%
14.478333332
 
0.1%
18.152222221
 
< 0.1%
30.881
 
< 0.1%
32.597751
 
< 0.1%
19.030483871
 
< 0.1%
28.554516131
 
< 0.1%
12.800681821
 
< 0.1%
6.3962146891
 
< 0.1%
26.087971011
 
< 0.1%
Other values (2765)2765
99.6%
ValueCountFrequency (%)
2.1505882351
< 0.1%
2.43251
< 0.1%
2.4623711341
< 0.1%
2.5112413791
< 0.1%
2.5153333331
< 0.1%
2.651
< 0.1%
2.6569318181
< 0.1%
2.7075982531
< 0.1%
2.7606215721
< 0.1%
2.7704641911
< 0.1%
ValueCountFrequency (%)
4453.431
< 0.1%
1687.21
< 0.1%
952.98751
< 0.1%
872.131
< 0.1%
841.02144931
< 0.1%
651.16833331
< 0.1%
6401
< 0.1%
624.41
< 0.1%
615.751
< 0.1%
602.45313231
< 0.1%

avg_recency_days
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1155
Distinct (%)41.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78.72774209
Minimum1
Maximum366
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:50.709589image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13
Q134.22222222
median59
Q399
95-th percentile224
Maximum366
Range365
Interquartile range (IQR)64.77777778

Descriptive statistics

Standard deviation66.46046693
Coefficient of variation (CV)0.8441810366
Kurtosis3.693037322
Mean78.72774209
Median Absolute Deviation (MAD)30
Skewness1.831604949
Sum218626.9398
Variance4416.993664
MonotonicityNot monotonic
2021-10-19T23:30:50.806008image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
7021
 
0.8%
4618
 
0.6%
5517
 
0.6%
4916
 
0.6%
3116
 
0.6%
9116
 
0.6%
2115
 
0.5%
3515
 
0.5%
4215
 
0.5%
2614
 
0.5%
Other values (1145)2614
94.1%
ValueCountFrequency (%)
19
0.3%
24
0.1%
2.8615384621
 
< 0.1%
36
0.2%
3.3303571431
 
< 0.1%
3.3513513511
 
< 0.1%
45
0.2%
4.1910112361
 
< 0.1%
4.2758620691
 
< 0.1%
4.51
 
< 0.1%
ValueCountFrequency (%)
3661
 
< 0.1%
3651
 
< 0.1%
3641
 
< 0.1%
3631
 
< 0.1%
3572
0.1%
3561
 
< 0.1%
3552
0.1%
3521
 
< 0.1%
3512
0.1%
3503
0.1%

freq_purchase
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
SKEWED

Distinct1225
Distinct (%)44.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04969445942
Minimum0.005449591281
Maximum17
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:50.908460image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0.005449591281
5-th percentile0.008746355685
Q10.01578947368
median0.0243902439
Q30.04166666667
95-th percentile0.1153846154
Maximum17
Range16.99455041
Interquartile range (IQR)0.02587719298

Descriptive statistics

Standard deviation0.3374125243
Coefficient of variation (CV)6.789741316
Kurtosis2299.003285
Mean0.04969445942
Median Absolute Deviation (MAD)0.01069161377
Skewness46.11035601
Sum138.0015138
Variance0.1138472115
MonotonicityNot monotonic
2021-10-19T23:30:51.002177image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.062518
 
0.6%
0.0277777777817
 
0.6%
0.0238095238116
 
0.6%
0.0833333333315
 
0.5%
0.0909090909115
 
0.5%
0.0344827586215
 
0.5%
0.0294117647114
 
0.5%
0.0192307692313
 
0.5%
0.0256410256413
 
0.5%
0.0212765957413
 
0.5%
Other values (1215)2628
94.6%
ValueCountFrequency (%)
0.0054495912811
 
< 0.1%
0.0054644808741
 
< 0.1%
0.0054794520551
 
< 0.1%
0.0054945054951
 
< 0.1%
0.0055865921792
0.1%
0.0056022408961
 
< 0.1%
0.0056179775282
0.1%
0.005665722381
 
< 0.1%
0.0056818181822
0.1%
0.0056980056983
0.1%
ValueCountFrequency (%)
171
 
< 0.1%
31
 
< 0.1%
21
 
< 0.1%
1.1428571431
 
< 0.1%
18
0.3%
0.751
 
< 0.1%
0.66666666673
 
0.1%
0.5508021391
 
< 0.1%
0.53351206431
 
< 0.1%
0.53
 
0.1%

qtd_returned
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
SKEWED
ZEROS

Distinct204
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34.92401873
Minimum0
Maximum9014
Zeros1484
Zeros (%)53.4%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:51.100103image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q39
95-th percentile96.4
Maximum9014
Range9014
Interquartile range (IQR)9

Descriptive statistics

Standard deviation290.507711
Coefficient of variation (CV)8.318278413
Kurtosis572.5647366
Mean34.92401873
Median Absolute Deviation (MAD)0
Skewness21.64144035
Sum96984
Variance84394.73018
MonotonicityNot monotonic
2021-10-19T23:30:51.190505image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01484
53.4%
1129
 
4.6%
2118
 
4.2%
382
 
3.0%
472
 
2.6%
663
 
2.3%
555
 
2.0%
1245
 
1.6%
839
 
1.4%
938
 
1.4%
Other values (194)652
23.5%
ValueCountFrequency (%)
01484
53.4%
1129
 
4.6%
2118
 
4.2%
382
 
3.0%
472
 
2.6%
555
 
2.0%
663
 
2.3%
738
 
1.4%
839
 
1.4%
938
 
1.4%
ValueCountFrequency (%)
90141
< 0.1%
80041
< 0.1%
44271
< 0.1%
37681
< 0.1%
33321
< 0.1%
28781
< 0.1%
20221
< 0.1%
20121
< 0.1%
17761
< 0.1%
15941
< 0.1%

freq_returns
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct424
Distinct (%)15.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2604033399
Minimum0
Maximum4
Zeros1484
Zeros (%)53.4%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:51.283761image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.2857142857
95-th percentile1
Maximum4
Range4
Interquartile range (IQR)0.2857142857

Descriptive statistics

Standard deviation0.4445423736
Coefficient of variation (CV)1.707130076
Kurtosis2.092084746
Mean0.2604033399
Median Absolute Deviation (MAD)0
Skewness1.48297621
Sum723.140075
Variance0.1976179219
MonotonicityNot monotonic
2021-10-19T23:30:51.387941image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01484
53.4%
1665
23.9%
210
 
0.4%
0.025641025647
 
0.3%
0.57
 
0.3%
0.28571428577
 
0.3%
0.256
 
0.2%
0.0094786729865
 
0.2%
0.22222222225
 
0.2%
0.019607843145
 
0.2%
Other values (414)576
 
20.7%
ValueCountFrequency (%)
01484
53.4%
0.0055710306411
 
< 0.1%
0.0056818181821
 
< 0.1%
0.0058651026391
 
< 0.1%
0.0059347181011
 
< 0.1%
0.0059523809521
 
< 0.1%
0.0060240963861
 
< 0.1%
0.0060422960731
 
< 0.1%
0.0061728395061
 
< 0.1%
0.0061919504641
 
< 0.1%
ValueCountFrequency (%)
41
 
< 0.1%
31
 
< 0.1%
210
 
0.4%
1665
23.9%
0.751
 
< 0.1%
0.66666666673
 
0.1%
0.57
 
0.3%
0.42857142861
 
< 0.1%
0.44
 
0.1%
0.33333333331
 
< 0.1%

avg_basket_size
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1939
Distinct (%)69.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean231.4362258
Minimum1
Maximum6009.333333
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:51.488129image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile45
Q1103.3333333
median172
Q3278.2
95-th percentile585.1
Maximum6009.333333
Range6008.333333
Interquartile range (IQR)174.8666667

Descriptive statistics

Standard deviation261.5699806
Coefficient of variation (CV)1.130203276
Kurtosis115.6198866
Mean231.4362258
Median Absolute Deviation (MAD)81
Skewness7.719257268
Sum642698.399
Variance68418.85475
MonotonicityNot monotonic
2021-10-19T23:30:51.578881image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10011
 
0.4%
869
 
0.3%
758
 
0.3%
608
 
0.3%
2087
 
0.3%
1367
 
0.3%
1057
 
0.3%
737
 
0.3%
1977
 
0.3%
827
 
0.3%
Other values (1929)2699
97.2%
ValueCountFrequency (%)
11
< 0.1%
3.3333333331
< 0.1%
5.3333333331
< 0.1%
5.6666666671
< 0.1%
6.1428571431
< 0.1%
7.51
< 0.1%
91
< 0.1%
9.51
< 0.1%
111
< 0.1%
11.8751
< 0.1%
ValueCountFrequency (%)
6009.3333331
< 0.1%
3868.651
< 0.1%
28801
< 0.1%
2733.9444441
< 0.1%
2518.7692311
< 0.1%
2160.3333331
< 0.1%
2082.2258061
< 0.1%
20001
< 0.1%
1903.51
< 0.1%
1866.9333331
< 0.1%

avg_basket_variety
Real number (ℝ≥0)

HIGH CORRELATION

Distinct897
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.14209043
Minimum0.2
Maximum177
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:51.673125image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile2
Q17.545454545
median13.5
Q322
95-th percentile45.05
Maximum177
Range176.8
Interquartile range (IQR)14.45454545

Descriptive statistics

Standard deviation14.25442928
Coefficient of variation (CV)0.8315455655
Kurtosis10.0194746
Mean17.14209043
Median Absolute Deviation (MAD)6.666666667
Skewness2.247023856
Sum47603.58512
Variance203.1887541
MonotonicityNot monotonic
2021-10-19T23:30:51.768559image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
834
 
1.2%
1333
 
1.2%
932
 
1.2%
1632
 
1.2%
732
 
1.2%
1230
 
1.1%
1429
 
1.0%
629
 
1.0%
1729
 
1.0%
18.529
 
1.0%
Other values (887)2468
88.9%
ValueCountFrequency (%)
0.21
 
< 0.1%
0.253
 
0.1%
0.33333333336
0.2%
0.41
 
< 0.1%
0.40909090911
 
< 0.1%
0.512
0.4%
0.54545454551
 
< 0.1%
0.55555555561
 
< 0.1%
0.57142857141
 
< 0.1%
0.61764705881
 
< 0.1%
ValueCountFrequency (%)
1771
< 0.1%
1051
< 0.1%
1041
< 0.1%
981
< 0.1%
95.51
< 0.1%
94.333333331
< 0.1%
93.333333331
< 0.1%
89.6251
< 0.1%
871
< 0.1%
85.666666671
< 0.1%

item_rp_ratio
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct1239
Distinct (%)44.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.01667525856
Minimum0
Maximum1.551912568
Zeros1484
Zeros (%)53.4%
Negative0
Negative (%)0.0%
Memory size21.8 KiB
2021-10-19T23:30:51.866471image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.008421052632
95-th percentile0.07155197946
Maximum1.551912568
Range1.551912568
Interquartile range (IQR)0.008421052632

Descriptive statistics

Standard deviation0.06610448176
Coefficient of variation (CV)3.964225293
Kurtosis148.6492602
Mean0.01667525856
Median Absolute Deviation (MAD)0
Skewness9.823007514
Sum46.30719302
Variance0.004369802509
MonotonicityNot monotonic
2021-10-19T23:30:51.961108image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01484
53.4%
0.0074626865673
 
0.1%
0.02439024393
 
0.1%
0.0024630541873
 
0.1%
0.012295081973
 
0.1%
0.0023094688223
 
0.1%
0.53
 
0.1%
0.014925373133
 
0.1%
0.013605442183
 
0.1%
0.0096618357493
 
0.1%
Other values (1229)1266
45.6%
ValueCountFrequency (%)
01484
53.4%
0.00011696362431
 
< 0.1%
0.00018399264031
 
< 0.1%
0.00028169014081
 
< 0.1%
0.00031407035181
 
< 0.1%
0.00036192544341
 
< 0.1%
0.00036324010171
 
< 0.1%
0.00036376864311
 
< 0.1%
0.00036710719531
 
< 0.1%
0.0003930817611
 
< 0.1%
ValueCountFrequency (%)
1.5519125681
< 0.1%
11
< 0.1%
0.63333333331
< 0.1%
0.60088365241
< 0.1%
0.59645669291
< 0.1%
0.56488549621
< 0.1%
0.56463878331
< 0.1%
0.56020408161
< 0.1%
0.53990610331
< 0.1%
0.53321033211
< 0.1%

net_margin
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct1297
Distinct (%)46.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.9790239128
Minimum-0.1218700375
Maximum1
Zeros1
Zeros (%)< 0.1%
Negative1
Negative (%)< 0.1%
Memory size21.8 KiB
2021-10-19T23:30:52.061416image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum-0.1218700375
5-th percentile0.9044539357
Q10.9846042137
median1
Q31
95-th percentile1
Maximum1
Range1.121870037
Interquartile range (IQR)0.01539578625

Descriptive statistics

Standard deviation0.06471553177
Coefficient of variation (CV)0.06610209509
Kurtosis81.57793872
Mean0.9790239128
Median Absolute Deviation (MAD)1.110223025 × 10-16
Skewness-7.508909849
Sum2718.749406
Variance0.004188100053
MonotonicityNot monotonic
2021-10-19T23:30:52.152895image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11337
48.1%
159
 
2.1%
145
 
1.6%
143
 
1.5%
0.98097273151
 
< 0.1%
0.99639832891
 
< 0.1%
0.95806228951
 
< 0.1%
0.99961497151
 
< 0.1%
0.53578682871
 
< 0.1%
0.98232794061
 
< 0.1%
Other values (1287)1287
46.3%
ValueCountFrequency (%)
-0.12187003751
< 0.1%
01
< 0.1%
0.14017054081
< 0.1%
0.25023409121
< 0.1%
0.35486018641
< 0.1%
0.481
< 0.1%
0.4835800721
< 0.1%
0.48960788521
< 0.1%
0.51552650011
< 0.1%
0.52021552081
< 0.1%
ValueCountFrequency (%)
145
 
1.6%
11337
48.1%
159
 
2.1%
143
 
1.5%
0.99991807691
 
< 0.1%
0.99984316131
 
< 0.1%
0.99972431321
 
< 0.1%
0.99967287611
 
< 0.1%
0.99961497151
 
< 0.1%
0.99954246981
 
< 0.1%

Interactions

2021-10-19T23:30:47.184336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:25.444391image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.938987image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.316134image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.812081image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.197042image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.749265image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.183014image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.603188image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.093280image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.475935image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.873049image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.216363image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.905697image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.340877image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.757733image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.265311image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:25.546753image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.017966image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.397378image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.898176image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.279939image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.832323image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.267312image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.683101image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.175271image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.558406image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.951731image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.309843image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.996439image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.432620image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.843040image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.345890image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:25.629482image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.098189image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.480456image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.984249image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.366546image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.920637image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.353340image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.762192image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.259020image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.641589image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.032570image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.403080image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.080446image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.518961image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.929813image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.427577image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:25.711058image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.181017image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.567114image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.071608image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.452179image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.010150image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.438102image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.839249image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.340936image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.724719image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.113617image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.491019image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.163978image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.605723image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.015096image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.513489image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:25.796428image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.266171image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.651130image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.160989image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.543202image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.102435image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.527191image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.922043image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.427581image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.812831image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.199743image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.587569image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.252214image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.693438image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.104849image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.600881image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:25.882900image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.352640image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.738026image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.252356image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.636061image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.196429image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.618255image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.006735image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.517226image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.901731image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.286982image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.681709image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.351118image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.784624image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.197968image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.690079image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:25.969182image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.439265image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.829110image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.344483image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.729028image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.289629image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.710930image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.092410image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.607477image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.991849image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.373450image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.999794image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.444111image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.876409image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.291050image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.778567image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.055966image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.528350image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.917054image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.435370image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.821179image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.384356image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.803581image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.179397image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.697351image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.082057image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.460598image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.094853image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.537827image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.967947image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.385688image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.855474image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.132451image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.607513image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.997177image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.514688image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.901850image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.467663image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.887524image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.258302image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.778317image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.164381image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.537774image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.179652image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.616450image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.049501image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.467804image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.940570image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.215568image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.696446image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.084758image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.601369image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.991708image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.557784image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.976732image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.342168image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.869160image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.254359image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.622980image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.273173image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.707040image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.139416image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.559247image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:48.025941image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.423007image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.789524image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.295962image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.688016image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.080848image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.652442image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.066406image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.425285image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.957989image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.342303image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.709850image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.368132image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.797991image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.230120image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.650026image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:48.367568image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.503072image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.870791image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.372805image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.767490image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.160646image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.735716image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.150264image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.501313image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.037652image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.423204image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.787508image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.456814image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.880648image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.312000image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.733737image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:48.454939image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.595145image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:27.962843image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.462199image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.856459image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.252482image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.827084image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.243826image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.586338image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.128105image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.513211image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.876710image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.547993image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:43.974388image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.404592image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.827257image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:48.536448image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.680550image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.050539image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.546917image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:30.939963image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.338670image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:33.915348image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.333348image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.845676image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.214775image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.599447image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:40.959330image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.635124image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.060867image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.491416image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:46.916323image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:48.620693image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.770266image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.142011image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.634092image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.026376image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.427035image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.006784image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.423052image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:36.928035image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.302486image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.693156image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.045277image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.723839image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.146689image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.580054image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.005882image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:48.706729image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:26.859770image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:28.232376image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:29.724623image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:31.114809image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:32.665083image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:34.098517image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:35.517036image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:37.014591image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:38.392486image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:39.786746image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:41.134238image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:42.814747image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:44.256035image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:45.672696image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2021-10-19T23:30:47.097545image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Correlations

2021-10-19T23:30:52.246243image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2021-10-19T23:30:52.392024image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2021-10-19T23:30:52.537805image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2021-10-19T23:30:52.684892image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2021-10-19T23:30:48.867083image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
A simple visualization of nullity by column.
2021-10-19T23:30:49.061414image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexcustomer_idgross_revenuerecency_daysqnt_purchasesvar_productsqnt_itemsavg_ticketavg_recency_daysfreq_purchaseqtd_returnedfreq_returnsavg_basket_sizeavg_basket_varietyitem_rp_rationet_margin
00178505391.21372.034.0297.01733.018.1522221.00000017.00000040.01.00000050.9705880.6176470.0230810.980973
11130473232.5956.09.0171.01390.018.90403552.8333330.02830235.00.023973154.44444411.6666670.0251800.955611
22125836705.382.015.0232.05028.028.90250026.5000000.04032350.00.105263335.2000007.6000000.0099440.988660
3313748948.2595.05.028.0439.033.86607192.6666670.0179210.00.00000087.8000004.8000000.0000001.000000
4415100876.00333.03.03.080.0292.00000020.0000000.07317122.00.07894726.6666670.3333330.2750000.725000
55152914623.3025.014.0102.02102.045.32647126.7692310.04011529.00.032468150.1428574.3571430.0137960.984472
66146885630.877.021.0327.03621.017.21978619.2631580.057221399.00.019608172.4285717.0476190.1101910.907032
77178095411.9116.012.061.02057.088.71983639.6666670.03352041.00.013072171.4166673.8333330.0199320.987609
881531160767.900.091.02379.038194.025.5434644.1910110.243316474.00.072193419.7142866.2307690.0124100.977808
99160982005.6387.07.067.0613.029.93477647.6666670.0243900.00.00000087.5714294.8571430.0000001.000000

Last rows

df_indexcustomer_idgross_revenuerecency_daysqnt_purchasesvar_productsqnt_itemsavg_ticketavg_recency_daysfreq_purchaseqtd_returnedfreq_returnsavg_basket_sizeavg_basket_varietyitem_rp_rationet_margin
2767561917290525.243.02.0102.0404.05.14941213.00.1428570.00.0202.00000046.0000000.0000001.000000
276856281478577.4010.02.03.084.025.8000005.00.3333330.00.042.0000001.0000000.0000001.000000
2769562917254272.444.02.0112.0252.02.43250011.00.1666670.00.0126.00000050.0000000.0000001.000000
2770564617232421.522.02.036.0203.011.70888912.00.1538460.00.0101.50000015.0000000.0000001.000000
2771564717468137.0010.02.05.0116.027.4000004.00.4000000.00.058.0000002.5000000.0000001.000000
2772565813596697.045.02.0166.0406.04.1990367.00.2500000.00.0203.00000066.5000000.0000001.000000
27735664148931237.859.02.073.0799.016.9568492.00.6666670.00.0399.50000036.0000000.0000001.000000
2774568914126706.137.03.015.0508.047.0753333.00.75000050.01.0169.3333334.6666670.0984250.911489
27755695135211092.391.03.0435.0733.02.5112414.50.3000000.00.0244.333333104.0000000.0000001.000000
2776570515060301.848.04.0120.0262.02.5153331.02.0000000.00.065.50000020.0000000.0000001.000000